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SIGNIFICANCE-BASED DISPLAY 
BACKGROUND OF THE INVENTION 

5 

Field of the Invention . 

The present invention relates to techniques for displaying data in graphical 
format, and particularly relates to the display of data having varying statistical 
significance. 

10 

Description of the Related Art . 

For a long time now, it has been recognized that displaying information 
graphically can be tremendously beneficial to the end user, allowing the end user to 
quickly absorb and identify relationships among large amounts of data. Bar graphs, 

15 chart graphs, scatter point graphs and various other types of graphs, when used in 
the appropriate context, can all provide such advantages. 

Despite the long-time use of graphical displays, the present inventors recently 
have recognized a significant shortcoming in conventional graphing techniques. 
Specifically, the present inventors have discovered conventional graphs often display 

20 various data points having varying levels of statistical reliability, but lack a means for 
effectively conveying that information to the end user. 

SUMMARY OF THE INVENTION 

The present invention addresses this need by providing a display in which a 
25 display characteristic of the displayed data points is a function of a measure of the 
statistical significance of the data point. 

Thus, in one aspect the invention is directed to displaying information. 
Initially, a number of data points is obtained, each data point including an estimated 
statistic. A measure of the statistical significance for each estimated statistic also is 
30 obtained, and a graph of the data points is displayed. According to this aspect of the 
invention, each data point is displayed at an intensity level that is a function of the 
measure of statistical significance of the estimated statistic included in such data 
point. 

1 
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By displaying data points at an intensity that corresponds to the statistical 
significance of the data point, the present invention can effectively and conveniently 
indicate to a user differences in statistical significance among various displayed data 
points. More specifically, using display intensity as a way to differentiate the 

5 statistical significance of different data points has been found to have highly intuitive 
value to the end user. 

Such a technique is particularly applicable to displaying sensitivities or 
elasticities of a variety of assets in a graphical format. Moreover, by using a non- 
linear mapping of statistical significance to display intensity, the present invention 

10 often can more effectively highlight particular regions of statistical significance that 
are of interest to the end user. 

In a further aspect, the invention is directed to displaying information. Initially, 
a number of data points is obtained, each data point including an estimated statistic. 
A measure of the statistical significance for each estimated statistic also is obtained, 

15 and a graph of the data points is displayed. According to this aspect of the invention, 
a display characteristic (such as size, brightness, hue, saturation or any other color 
characteristic) of each data point is a function of the measure of statistical 
significance of the estimated statistic included in such data point. 

Indicating statistical significance of data points by changing such display 

20 characteristics often can provide many of the same benefits described above for 
indicating statistical significance using display intensity. In particular, display 
characteristics other than intensity may be especially useful where a color display is 
used. For example, altering the saturation of a displayed data point in a color display 
often will have a similar visual impact as altering the intensity of the displayed data 

25 point in a grayscale display. 

The foregoing summary is intended merely to provide a brief description of the 
general nature of the invention. A more complete understanding of the invention can 
be obtained by referring to the claims and the following detailed description of the 
preferred embodiments in connection with the accompanying figures. 

30 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a flow diagram for explaining evaluation and screening of assets 
according to a representative embodiment of the invention. 

Figure 2 is a flow diagram for explaining portfolio evaluation and screening 
5 according to a representative embodiment of the invention. 

Figure 3 illustrates display of asset elasticity information according to a 
representative embodiment of the present invention. 

Figure 4 is a block diagram of a general-purpose computer system, 
representing one suitable computer platform for implementing the methods of the 
10 present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The following disclosure pertains to multiple inventions that are claimed in 
separate patent applications. The commonly assigned patent applications filed of 
15 even date herewith and titled, "Identifying Industry Sectors Using Statistical 
Clusterization" and "Sensitivity/Elasticity-Based Asset Evaluation and Screening" are 
incorporated herein by reference as though set forth herein in full. 

Asset Evaluation and Screening . 

20 The present invention provides asset evaluation and screening techniques 

that may be incorporated into an asset evaluation/screening tool for use in portfolio 
management and financial planning. In the preferred embodiments, the techniques 
of the present invention create a model for predicting the value of an asset (such as 
a stock) based on various exogenous variables. The model is generated by using 

25 historical data for the value of the asset and for the exogenous variables. Similar 
models are then created for a pool of other assets. Such models can then be used 
to perform "what if analysis, allowing a user to input various scenarios and then 
obtain information as to how various characteristics of a specified asset will change. 
In addition, the techniques of the present invention can permit asset screening based 

30 on such characteristics. 

Figure 1 illustrates a flow diagram for explaining asset evaluation and 
screening according to a representative embodiment of the present invention. 
Briefly, according to Figure 1 , historical data are input for an asset; a price formula 

3 

0248178.1 



35512-00035 



is determined for the asset based on the input historical data; the foregoing steps are 
repeated for different assets; scenario values are then input for certain exogenous 
variables; value(s) are calculated for selected asset(s) based on the input scenario; 
a tendency of the asset value to change is calculated, based on the input scenario; 
5 any desired screening is performed; the calculation of tendency of asset value(s) to 
change and screening steps may then be repeated for alternate exogenous variable 
scenarios; based on results of the foregoing evaluation and screening, asset 
holdings are adjusted; in addition, the models periodically are updated to incorporate 
new historical data. 

10 In more detail, in step 22 historical data are input for the first asset. Such 

information preferably includes measurements of the value of the asset and other 
data that relate to general macroeconomic conditions, and also may include other 
information that is more specific to the asset. In the preferred embodiment of the 
invention, a list of such variables is specified and the data value for each variable is 
15 input at each of plural specified points in time over an extended time interval. For 
example, values for all variables at a predetermined time each day (e.g., the close 
of business, Pacific Time) may be input for each business day in a previous time 
period T, where T may be any time period but preferably is at least 30 days in length, 
in order to obtain a statistically meaningful sample. For example, T may be 180 
20 days, 1 year, 2 years, 3 years, 4 years or even longer. Currently, it is preferable to 
use a time period T of the immediately preceding 3 years. 

Examples of the types of general macroeconomic data that may be included 
are any or all of the following: Federal Funds Rate Daily; 1 -year Treasury Bill Rate; 
10-year Treasury Constant Maturity Rate Daily; 30-year Treasury Constant Maturity 
25 Rate Daily; Moody's Seasoned Baa Corporation Bond Yield Daily; Consumer Debt 
Service Payments as Percent of Disposable Personal Income; Corporate Net Cash 
Flow; Net Foreign Investment; Total Consumer Credit Outstanding, Not Seasonally 
Adjusted; Trade Weighted Exchange Index: Major Currencies; Total Business 
Inventories: Manufacturers, Retailers & Merchant; Inventory/Sales Ratio: Total 
30 Business; Manufacturers New Orders: Non-defense Excluding Aircraft & Parts; Retail 
Sales, Not Seasonally Adjusted; New Privately Owned Housing Units Started: 
Structures with One Unit; Total Industrial Production Index; Oil Price Domestic: West 
Texas Intermediate; Real Gross Domestic Product in Chained 1996 Dollars; Real 
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Gross Private Domestic Investment; National Defense Consumption Expenditures 
& Gross Investment; Real Nonresidential Investment: Equipment & Software; Real 
Net Exports; Consumer Price Index (CPI) for All Urban Customers; CPI - Energy; PPI 
- All Commodities; Money Stock; Adjusted Monetary Base; St. Louis Adjusted 

5 Monetary Base; NAPM: Composite Index; Composite Index of Leading Indicators, 
1992=100; University of Michigan: Consumer Sentiment; University of Michigan: 
Inflation Expectation; Compensation of Employees; Unemployment Rate; and/or 
Median Duration of Unemployment. 

Examples of the types of information specific to the asset that may be input 

10 in step 22, in the case that the asset is a share of stock, or is related to the value, 
return or some other characteristic of a share of stock, include: (i) any of a variety of 
trading information regarding the stock, such as the stock price, stock trading 
volume, volatility of the stock price, trading price of options on the stock, information 
pertaining to analyst recommendations, and/or any of the foregoing information 

15 normalized with respect to data for either similar stocks (e.g., stocks in the same 
sector) or the market as a whole; and/or (ii) any of a variety of information pertaining 
to the company that issues the stock, such as industry classification, number of 
employees, any or all of the company's financial information (e.g., book value, 
amount of debt, debt/equity ratio, amount of profits amount of revenues or types of 

20 assets), usage rates of particular raw materials, employee and/or management 
turnover rates, and/or information pertaining the company's amount and/or type of 
diversification. 

The foregoing macroeconomic, asset-specific (other than the measurement 
of value of the asset) and/or sector-specific variables are referred to herein as the 

25 exogenous variables. In addition, the exogenous variables may include not only 
financial and economic data (such as those listed above), but any other type of data 
as well. For example, it is possible to include exogenous variables whose data 
values pertain to population, climate, popular tastes or sentiments, political 
environment, current mass media content, and/or any other social, environmental or 

30 physical conditions. 

Moreover, in addition to inputting actual historic data values, the exogenous 
variables may include forecasts of any economic or financial data (such as forecasts 
of any of the above-mentioned data) or even forecasting errors. With respect to 
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forecasts and/or forecast errors, the data value for any forecast or forecast error may 
be deemed "current" (for purposes of data input) either at the time the forecast is 
made, as of the date/time with respect to which the forecast is made, or at any other 
arbitrarily selected time. 
5 In general, it will be desirable that for each time point at which data are 

entered, current data values for all exogenous variables used should be input. To 
the extent that current information for any such exogenous variable is not available 
at any such given point in time, it generally will be preferable to use the most recent 
data for such exogenous variable. For example, certain economic data may be 

10 announced only monthly or only quarterly. If this is the case, the most recently 
announced data value is used until the next announcement. Particularly in cases 
where the announcement of any data value is significantly less frequent than the 
desired frequency of data input (but also in any other cases as well), it may be 
preferable to include an additional exogenous variable that specifies how current the 

15 data value for one of the other exogenous variables is. 

In this regard, certain announced data may be indicated as being valid for only 
a specific previous historical period. For instance, a certain measurement of the 
unemployment rate for July may not be announced until late August. In such a case, 
it is preferable to use the announced unemployment rate for all data input times that 

20 fall in July. To the extent such unemployment rate information is required for August 
but is not yet announced, it is preferable to use the announced July rate for all data 
input times that fall in August (with or without a seasonal adjustment factor), together 
with an additional exogenous variable that indicates the duration of time since the 
effective date of the last announcement (e.g., a variable indicating the date in 

25 August). 

When data are subsequently input for other assets (as described below) much 
of the data previously input for the exogenous variables may be reused. However, 
to the extent that different data input times are used for different assets, it might be 
necessary to input new data values. To the extent that the data input times are 
30 identical (or are at least close enough in time that new announcements have not yet 
been made), the following is a list of the types of data that typically may be reused 
for different assets: the general economic data; the data that are not related to 
economic or financial factors; any industry-specific data, provided that two securities 
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are issued by companies in the same industry sector (however defined); and any 
other data that is not unique to one asset relative to the other. 

In the preferred embodiment of the invention, the exogenous variables include 
only financial, economic and/or other types of data that are not particularly 
5 associated with any individual asset, any single class of assets or any single industry 
but instead affect various assets in various classes and industries. More preferably, 
the exogenous variables include the specific general economic information listed 
above. As a result, provided that the data input time points are the same for different 
assets, once a set of data has been input for the first time, only an indicator of asset 

10 value will have to be input for each additional asset. 

In step 24, a price formula is determined for the current asset based on the 
data input for the asset in step 22. The determined price formula relates the 
indicator of the value of the asset that was input in step 22 to the exogenous 
variables input in step 22. If Y represents the value of the asset and X represents 

15 a vector containing the data values for the exogenous variables that are to be used 
in estimating Y, then the relationship between Y and X can be expressed as: 

Y = fpQ + U (Eq. 1) 

20 where U is independent of X and, by including an appropriate constant term in f(X), 
can be assumed to be zero-mean. As a result, the expected value of Y equals the 
expected value of f(X), which can be stated algebraically as follows: 

E{Y}= E{f(X)} (Eq.2) 

25 

Eq. 1 can be expanded, for example, using a Maclaurin series expansion. In 
the simplest case of such an expansion, in which X consists of a single variable, Y 
can be expressed as follows: 

30 Y = p0 + pl*X + p2*X 2 + p3*X 3 + ... + U (Eq. 3) 

where U is the approximation error and is independent of X. In order to obtain a 
practical representation of Y, the infinite series represented by Eq. 3 is truncated. 
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Although Eq. 3 can be truncated at any point, it is presently preferable to truncate 
Eq. 3 by eliminating all powers of X greater than 2 or 3. As a result, U generally can 
be assumed to be uncorrelated with X. Eq. 3 above also can easily be modified to 
express the more general case of a Taylor series expansion. 
5 When X consists of multiple variables X i? the Maclaurin series expansion will 

include the higher order terms of the various X { as well as cross-product terms, such 
as Xj Xj. For example, the second order Maclaurin series expansion is given as: 

N N N 

J=l l=\ J=l 

10 

where N is the number of exogenous variables X,. In Eq. 4, each is the first order 
partial derivative of Y with respect to Xj evaluated at the origin, and the are the 
mixed partial derivatives of Y with respect to X { and Xj evaluated at the origin. Of 
course, higher order Maclaurin series expansions are also possible (e.g., third order). 

15 In addition, Eq. 4 also can easily be modified to express the more general case of 
a Taylor series expansion. 

In order to determine the price formula for predicting the value of an asset, it 
is necessary to determine the coefficient values (e.g., in the case of a second order 
expansion, values for a and for all and c^) in the above-described Maclaurin or 

20 Taylor series expansion. Such values can be determined in any of a variety of ways. 
In one embodiment, the coefficients are calculated using a statistical regression 
technique, such as by minimizing the total of some function of the error (e.g., 
magnitude of error or squared error) between each actual data point and the point 
predicted by the resulting formula. Such techniques are well known in the art and 

25 therefore are not discussed in detail here. 

Although the preferred embodiment of the invention uses a Taylor series 
expansion representation for Y, any other predefined parametric equation may used 
instead, such as a Fourier series expansion or similar frequency-space 
transformation. In any event, the parameters for any such predefined parametric 
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equation generally can be determined in a similar manner to that described above, 
e.g., minimizing the total of some function of error. 

As a still further alternative, it is possible to determine a price formula where 
the format of the equation is not predefined, but rather determined dynamically 
5 based on the input data. The preferred method for implementing such a solution is 
to use a neural network technique. As is well known in the art, neural networks 
typically operate by propagating data throughout a network of nodes, with a weight 
applied to each data element as it propagates from one given node to another given 
node. 

10 The neural network is trained to produce the correct response by inputting 

sample input-output pairs, observing the system's actual output in response to the 
sample input, and comparing such actual outputs to the provided sample outputs. 
The training algorithm then adjusts the weights between the nodes, and may even 
create and/or delete nodes, based on the results of the comparison. Any of 

15 numerous different training algorithms may be used, such as a genetic algorithm. 
Also, by restricting creation and deletion of nodes, a neural network may also be 
used to identify coefficients in the above-described simpler case of a predefined 
parametric equation. 

In the present case, such a neural network is trained using the data input in 

20 step 22 to provide the appropriate price (or other measurement of value) for the 
asset as a function of the data values for the exogenous variables. Once training 
has been completed using such data, the structure and weighting coefficients of the 
neural network are fixed and define a formula that provides an estimate for the value 
of the asset in response to an input of any data values for the set of exogenous 

25 variables. 

In the examples given above, the price formula expresses the actual value of 
the price (or other measure of value) of the subject asset as a function of the actual 
data values for the exogenous variables. Such a formulation lends itself will to 
determining the sensitivity of Y to each variable making up X (i.e., X;) because in this 
30 case the partial derivative of Y with respect to X ; gives the sensitivity of the value of 
the asset to the exogenous variable corresponding to Xj. However, in the preferred 
embodiment of the invention, the price formula is expresses a logarithm of the value 
of an asset as a function of logarithms of the exogenous variables. In this alternative 
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formulation, the partial derivative of Y with respect to Xj gives the elasticity of the 
value of the asset to the exogenous variable corresponding to X ; . 

Also, in calculating the price formula as described above, it is possible to treat 
all data points equally. Alternatively, it may instead be preferable to weight more 

5 recent observations more heavily than those observations that are more remote in 
time. In addition, as both the measure of the subject asset's value and the data 
values for the exogenous that are used typically will only be estimates of the actual 
values and data values, respectively, in certain cases it may be preferable to more 
heavily weight those observations that are known with more certainty (e.g., lower 

10 variance). 

In step 26, a determination is made as to whether price formulas have been 
calculated (in step 24) for all of the assets of interest. It is noted that it may be 
desirable to perform step 24 for all assets for which data have been input (i.e., all 
assets in the tool's database) or for only the subset of such assets that are of interest 

15 to the current user. If the determination in step 26 is affirmative, processing 
proceeds to step 30. If not, processing proceeds to step 28 to input historical data 
(preferably reusing previously input data to the extent possible, as discussed above) 
for the next asset and to calculate a price formula for that asset in step 24. 

In step 30, values for the exogenous variables, collectively comprising a 

20 particular scenario, are input. Typically, a user will manually input such a scenario. 
However, some or all of the data values comprising the scenario may be generated 
automatically, such as maybe provided by a separate forecasting system, e.g., using 
any of the techniques described in commonly assigned U.S. patent application serial 
numbers 09/392,361 , 09/391 ,765, 09/392,109, 09/391 ,962, 09/391 ,534, 09/392,106, 

25 or 09/391 ,764, filed September 8, 1999, or 09/494,200, filed January 28, 2000, all 
of which are incorporated herein by reference as though set forth herein in full. 

There are many different techniques for inputting a scenario. For example, 
in one embodiment of the invention, a data value for each variable is separately 
input. In an alternative embodiment, default values have already been entered for 

30 the exogenous variables, and therefore it is only necessary to replace those default 
values as desired. Preferably, the default data value for each exogenous variable 
(i.e., the data value to be used if no other data value is provided for such variable) 
is the most currently available data value for the variable. In a still further 

10 
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embodiment, only changes in the default values are required to be input, with the 
default change value being zero. It is noted that such changes may be input as 
either the actual expected difference from the default value or as the expected 
percentage change in the default value. Still further, it is possible to give the user the 
5 option as to which input method to use. Regardless of how the scenario is initially 
input, the tool according to the present invention preferably converts such inputs into 
a complete set of data values for the exogenous variables for use in the processing 
described below. 

Due to the interrelationships among the exogenous variables, when a change 
10 in a data value for one of the exogenous variables is input, it may be preferable in 
certain embodiments of the invention to automatically account for the changes 
expected in the other exogenous variables as a result of such input change. More 
details regarding such a feature and the tradeoffs pertaining to incorporating such 
a feature are described in connection with the discussion of step 34 below. 
15 In step 32, the value of each asset under consideration is determined, 

preferably by plugging the data values for the exogenous variables input in step 30 
into the price formula for such asset calculated in step 24. The price formula 
typically can be calculated in a straightforward manner by replacing the exogenous 
variables with the corresponding scenario data values and then calculating the result 
20 of the formula. In an embodiment in which a neural network (or similar network- 
based solution) is used, the data values for the exogenous variables typically can be 
provided as the inputs to the network, with the network output being the asset value 
estimate. 

In step 34, the tendency(ies) of one or more of the asset value(s) to change 
25 as a result of change(s) in one or more of the exogenous variables are calculated, 
based on the input scenario. Preferably, such tendencies will be sensitivities and/or 
elasticities of the value of the asset to one or more of the exogenous variables. 
However, any other measures of tendency to change may instead (or also) be 
calculated. 

30 Ordinarily, in the case where a predefined parametric equation is used, these 

calculations will mainly involve taking or estimating partial derivatives of the price 
function with respect to the exogenous variables of interest. In particular, when the 
price formula is determined from a pre-defined parametric equation, a closed-form 
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solution for each such partial derivative often can be determined in advance and then 
stored. For instance, assume that Y is given as a second order polynomial function 
of X, that X consists of only two variables, X, and X 2 , and that Y represents the 
actual value of the asset and the X { represent the actual data values for the 
5 exogenous variables. In this case, the sensitivity of Y to X x is found by taking the 
partial derivative of Y(as expressed in Eq. 4 above) with respect to X Y and therefore 
is given as: 

4/ &x = h + b 2 (<* 2 /<%i ) + Cyl x 2 + c, 2 x, {<% 2 / ) + 2cu x x + 2c 22 x 2 {<% 2 M ) ( Eq . 5) 

10 

In Eq. 5, the b and c coefficients were determined in step 24 and the values 
for X! and X 2 were input in step 30. The only remaining value to be supplied is the 
partial derivative of X 2 with respect to X,. This value can be assumed to be zero if 
Xj and X 2 are known to be largely independent of each other or, subject to the 

15 considerations described below, may be arbitrarily assumed to be zero. Otherwise, 
the relationship between Xj and X 2 can be determined by performing a linear or non- 
linear regression technique using historical data for the two corresponding 
exogenous variables, by performing a neural network technique using such data to 
train the network, or in any other manner. Regardless of which technique is used, 

20 it is preferable also to evaluate the statistical significance of the correlation between 
Xj and X 2 and then to assume that dX 2 /dX! is zero if such statistical significance is 
less than a specified (e.g., predetermined) threshold. 

In general, in order to obtain a closed-form solution of each partial derivative 
of any price formula that is expressed as a polynomial expansion, it typically will be 

25 necessary to either: (i) evaluate the partial derivative of each exogenous variable with 
respect to each other exogenous variable and also to evaluate the statistical 
significance of each such partial derivative; or (ii) assume that such partial 
derivatives are equal to zero. 

In certain cases, a closed-form solution of the partial derivatives cannot easily 

30 be obtained. For example, this situation is likely to occur when a neural network is 
utilized or even in certain cases involving more complicated pre-defined parametric 
equations. In such cases, it is possible to obtain an estimate of the instantaneous 
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derivative directly from the price formula obtained in step 24. In one example, such 
an estimate is obtained by observing the value of the asset calculated in step 24 
using the scenario input in step 22, slightly changing the data value of one of the 
exogenous variables (e.g., by 1% of its previous value), and then calculating the 

5 change in the asset's value divided by the change in the data value for the 
exogenous variable. 

On the other hand, recognizing that a change in one exogenous variable 
might correlate with changes in one or more other exogenous variables, an 
alternative embodiment of the present invention estimates the instantaneous 

10 derivative of the price formula in such cases by taking into account any changes that 
are likely to occur in the other exogenous variables as a result of the small change 
in the one exogenous variable. To accomplish this, in one embodiment of the 
present invention the partial derivative (e.g., sensitivity) of each exogenous variable 
with respect to each other exogenous variable and the statistical significance of each 

15 such partial derivative are calculated, such as described above. Then, the effect of 
a slight change in the data value of one of the exogenous on the data values of the 
other exogenous variables can be readily calculated. Accordingly, the calculated 
changes in such other exogenous variables are applied, as well as the change in the 
data value for the subject exogenous variable, and the resulting new set of data 

20 values is input into the network (or plugged into the price formula) to calculate a new 
value for the asset. By dividing the change in the asset value by the change in the 
data value for the subject exogenous variable, it may be possible to obtain a more 
complete measure of the tendency of the asset value to change as a result of a 
change in a particular one of the exogenous variables. 

25 As indicated above, two distinct approaches exist for determining the 

tendency of an asset value to change as the result of a change in the data value for 
an exogenous variable. In the first approach, the sensitivity of each exogenous 
variable to each other exogenous variable is ignored (i.e., the exogenous variables 
are treated as being independent). In the second approach, the sensitivities of the 

30 exogenous variables to each other are taken fully into account in determining the 
tendency of the asset value to change as a result of a change in one of the 
exogenous variables. The particular approach selected typically will depend upon 
the needs of the user. 
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With the first approach, the user generally will be required to account for 
correlations between the exogenous variables in some other way, such as in 
connection with subsequent processing of the various price sensitivities, elasticities 
or other measurements of tendency to change. On the other hand, with the second 
5 approach, in subsequent processing the user typically must recognize that certain 
changes in the exogenous variables have been automatically anticipated; therefore, 
such subsequent processing: (i) generally must attempt, for each exogenous 
variable, to estimate only that portion of the change that would not have been 
predicted by previously entered changes in the other exogenous variables; and (ii) 
10 may be required, in certain circumstances, to back out redundantly reflected 
relationships among the exogenous variables. 

It should be noted that similar considerations and tradeoffs in determining 
whether to reflect expected changes in related exogenous variables may also be 
made in connection with the inputting the various projected scenarios in step 30 
15 (discussed above). Also, it is possible to make either option available to the user 
(which options may be made available independently for steps 30 and 34) and let the 
user select the appropriate option to use for each application (e.g., by selecting a 
corresponding configuration setting). 

As noted above, in the event that the price formula obtained in step 24 relates 
20 actual value of the asset to actual data values for the exogenous variables (i.e., Y 
represents the value of the asset and X ; represents the data value of the 
corresponding exogenous variable), then simply estimating a partial derivative of Y 
with respect to Xj will provide the sensitivity of the asset value to such exogenous 
variable. Obtaining the elasticity of the asset value to such exogenous variable in 
25 this case will require calculating (X i fY)*(6Y/dX i ). On the other hand, if the price 
formula obtained in step 24 relates the logarithm of the asset value to the logarithms 
of the data values for the exogenous variables (i.e., Y represents a logarithm of the 
value of the asset and Xj represents a logarithm of the data value of the 
corresponding exogenous variable), then simply estimating a partial derivative of Y 
30 with respect to X, will provide the elasticity of the asset value to such exogenous 
variable. 

In step 36, any desired asset screening is performed. According to the 
present invention, such screening can be based on the scenario-based estimates of 
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asset values and/or tendencies of the asset values to change in response to 
changes in the exogenous variables (e.g., sensitivities or elasticities), calculated 
above, instead of or in addition to the factors conventionally used for screening 
stocks and other assets. 
5 For example, a user may in step 30 input a scenario in which the 

unemployment rate, inflation rate and the price of crude oil change in specific 
amounts, but all other exogenous variables remain at their default values (or, 
depending upon the system configuration and possibly the option settings selected 
by the user, change in the amounts expected based on the changes specified for 

10 those three exogenous variables). After steps 32 and 34 have completed for all 
assets desired to be searched, the user may then in step 36 search for all stocks that 
have increased in value by at least a specified percentage and that have price 
elasticities to the Japanese exchange rate and to Gross National Product that are 
within a specified range. The user may further limit the search to only those stocks 

15 issued by companies that have fewer than 500 employees. In fact, assets can be 
screened in this manner based on any combination of projected asset value under 
the specified scenario, sensitivity or elasticity to any exogenous variable(s) given the 
specified scenario, and/or any other information that has been input or derived for 
assets in the database (e.g., any of the information conventionally used for asset 

20 screening). 

It is noted that it is not necessary to calculate a value for each asset in step 
32 and a tendency of asset value to change for each asset with respect to each 
exogenous variable in step 34. Rather, steps 32 and 34 may instead be performed 
only to the extent needed in connection with a user's analysis of particular assets or 

25 in connection with screening over an identified group of assets. For example, it may 
be more efficient in the example given above to first identify those companies that 
have fewer than 500 employees in the database and then calculate the asset values 
in step 32 only for those companies and calculate tendencies of the asset values to 
change in step 34 only for those companies and only with respect to the Japanese 

30 exchange rate and to the Gross National Product. 

In step 37, the user's holdings are adjusted based on the results of the 
analysis in steps 32 and/or 34 and/or based on the screening in step 36. For 
example, after determining a projected value and projected elasticities for an 
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individual stock based on an input scenario, a user may decide to sell some or all of 
the stock, short sell the stock, purchase the stock, purchase or sell an option on the 
stock, purchase or sell another derivative instrument whose value is based on a 
characteristic of the stock, and/or initiate any other purchase, sale or other economic 

5 transaction to meet the user's financial goals. Such decisions may be: (i) made 
solely by the user based on the above-described information provided by the 
evaluation/screening tool of the present invention; (ii) recommended to the user by 
the evaluation/screening tool by including within the tool capabilities for a user to 
input the user's financial goals and process steps for automatically screening stocks 

10 and/or other financial assets to attain those financial goals (which recommendations 
may be provided by the evaluation/screening tool to the user with or without the 
underlying data on which such recommendations were made); and/or (iii) performed 
automatically by the evaluation/screening tool without user input after evaluating the 
user's financial goals and performing any indicated screening (e.g., according to 

15 predetermined process steps). 

Options (ii) and (iii) above require the evaluation/screening tool of the present 
invention to include additional analytical functionality, typically directed toward 
making the tool more user-friendly. However, such functionality generally is relatively 
straightforward to implement. For instance, assume that a user has indicated that 

20 he wants to maximize growth within a specified time horizon, subject to the condition 
that risk should be limited with respect to certain specified exogenous variables. In 
this case, the tool preferably would search the stocks in the database and sort such 
stocks into groups having negative, positive and approximately zero elasticities to 
each of the exogenous variables; calculate the expected returns to each such stock; 

25 and then construct a portfolio, possibly using an iterative technique, that balances 
the elasticities to within the specified limits while achieving the maximum possible 
return. Depending upon whether option (ii) or option (iii) is being implemented, the 
resulting trades required to achieve that portfolio are then either recommended to the 
user or initiated automatically by the tool. The process steps which determine which 

30 trades to make may also be supplemented to account for tax implications and/or 
trading costs. 

Although step 37 is shown in Figure 1 and discussed above as being 
performed after step 36, it should be understood that step 37 may also or instead be 
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performed at various points in the process, such as immediately after step 32 or 
immediately after step 34. 

In step 38, a determination is made as to whether any additional scenarios 
need to be tested. In the preferred embodiment of the invention, the user simply 

5 indicates whether or not he or she would like to input another scenario. However, 
if the tool according to the present invention is incorporated into a more extensive 
financial or economic analysis system, another program or system might provide this 
indication. If analysis of an additional scenario is desired, then processing returns 
to step 30 to input data values for the exogenous variables, if not, then processing 

10 proceeds to step 40. 

In step 40, a determination is made as to whether the model needs to be 
updated. In the preferred embodiment of the invention, the price formulas are 
recalculated periodically using historical data over a rolling period of time. For 
example, the price formula may be generated using data over the past three years 

15 and recalculated each business day. When it is time to update the model, 
processing returns to step 22 to input historical data for the first asset, together with 
historical data values for the contemporaneous exogenous variables. When step 22 
is being repeated, as contrasted with the first time the entire process is executed, it 
generally is not necessary to input the entire data set. Rather, only the new data 

20 generally need to be added and the old data (outside the rolling period) deleted from 
the data set upon which the price formula is calculated. In addition, weights may be 
reassigned to reflect the relative recency of each data entry in the thus newly formed 
data set. 

Figure 2 illustrates a flow diagram for evaluating/screening portfolios 
25 according to a representative embodiment of the present invention. In general, much 
of the processing for evaluating/screening portfolios will be similar to that used for 
analyzing/screening individual assets, but with certain additional functionality. In fact, 
in the embodiment described below, both types of functionality are provided. 

In step 122, historical data for a first asset are input. This step is essentially 
30 identical to step 22 described above. 

In step 124, a price formula is determined for the current asset. This step is 
essentially identical to step 24 described above. 
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In step 126, a determination is made as to whether a price formula has been 
calculated forthe last asset to be processed. As with step 26, discussed above, step 
124, may be performed for all assets for which data have been input or only a subset 
of such assets that are of interest to the current user. If the determination is 
affirmative, processing proceeds to step 130. If not, processing proceeds to step 
128 to input historical data (preferably reusing previously input data to the extent 
possible, as discussed above) forthe next asset and to calculate a price formula for 
that asset in step 124. 

Steps 128 and 130 are essentially identical to steps 28 and 30, respectively, 
as such steps are described above. 

In step 131, composition information is input for one or more portfolios of 
interest. Preferably, the input portfolio composition information includes the type and 
quantity of each asset (e.g., type of stock and number of shares). Such information 
may be input directly by a human user via a user interface (e.g., a graphical user 
interface) or may be input by another computer program or system operating in 
conjunction with the asset evaluation/screening tool of the present invention. 

In step 132, asset values are calculated. With respect to individual assets, 
such as individual stocks or individual commodities, this step is essentially identical 
to step 32 described above. However, in addition to allowing the user to obtain the 
value of individual assets, in this embodiment of the invention step 132 also allows 
the user to obtain the value of the portfolios defined in step 131 under the scenario 
input in step 130. Such a portfolio value preferably is obtained by summing the 
values of the assets included within the subject portfolio. 

In step 1 34, the tendencies of asset values to change in response to changes 
in the exogenous variables are calculated. With respect to individual assets, such 
as individual stocks or individual commodities, this step is essentially identical to step 
34 described above. However, in addition to allowing the user to obtain measures 
of the tendencies of the values of individual assets to change, in this embodiment of 
the invention step 134 also allows the user to obtain similar measures for the 
portfolios defined in step 131 under the scenario input in step 130. Such a measure 
for the portfolio preferably is obtained by performing a weighted average of the 
corresponding measures forthe assets included within the subject portfolio. 
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In step 136, any desired screening is performed. With respect to individual 
assets, such as individual stocks or individual commodities, this step is essentially 
identical to step 36 described above. However, this step preferably allows the user 
to search from among different portfolios as well as different individual assets, using 

5 any of the screening criteria described above for individual assets. 

In step 137, adjustments in the user's holdings are made based on the 
evaluation/screening data provided by steps 132, 134 and/or 136. This step is 
essentially identical to step 37 described above, but extended to include more 
portfolio-related transactions. Thus, a user may be allowed to supplement or modify 

10 the user's existing portfolio(s) or create one or more additional portfolios. As noted 
above, such actions can be fully automated or can be implemented with varying 
levels of participation from the user. 

In step 138, a determination is made as to whether an additional scenario is 
required to be analyzed. This step is essentially identical to step 38 described 

15 above. However, here the user has the option to alter not only the data values for 
the exogenous variables but also the composition of one or more portfolios. As a 
result, the user is provided with significant flexibility to project how various changes 
in his or her portfolio, as well as changes in external conditions, will affect the 
portfolio's value and/or the portfolio's exposures to various specific risks. 

20 Finally, in step 140 a determination is made as to whether the model needs 

to be updated. This step and the considerations pertaining thereto are essentially 
identical to step 40 described above. If the model does need to be updated, 
processing returns to step 122. 

In the foregoing embodiments of the invention, a price formula is calculated 

25 based on historical data for values for of an asset and historical data values for a 
number of exogenous variables, and then a measure of the tendency of the asset 
value to change as a result of changes in the exogenous variables is calculated from 
that price formula. It should be noted that it is also possible to directly calculate a 
return formula that expresses changes in the value of the asset as a function of 

30 changes in the data values for the exogenous variables. For instance, by initially 
inputting data values corresponding to changes in the value of the asset (e.g., either 
actual quantity changes or percentage changes) over some period of time (preferably 
a rolling period of time) and changes in the exogenous variables (e.g., quantity 



0248178,1 



19 



35512-00035 



changes or percentage changes) over the same period of time, a return formula that 
relates such price changes to such changes in the exogenous variables can be 
obtained, using eithera linear or non-linear regression ora neural network technique, 
in a similar manner to that described above. 

5 It is also possible to calculate the price or return formulas, and corresponding 

measures of tendency of asset value to change based on the exogenous variables, 
separately in different environments. For instance, it is possible to calculate one 
price formula using only data for periods of increasing inflation rates and another 
price formula calculated using only data for periods of decreasing inflation rates. In 

10 this way, for instance, it can be determined whether the elasticities of the asset 
values to inflation rates are symmetric (i.e., the same during periods of rising inflation 
as during periods of declining inflation) or, if not, how they differ. Alternatively, 
similar information could be obtained by including an additional variable that 
indicates the change in the rate of inflation. Such an additional variable could be 

15 binary (i.e., indicating either increasing or decreasing inflation rates) or could indicate 
the change in the inflation rate (either in terms of the quantity change in the inflation 
rate or in terms of the percentage change in the rate). 

I n addition to determining different environments in the foregoing deterministic 
manner, it also may be preferable, in certain circumstances, to dynamically define 

20 the different environments for which separate models are to be generated. For 
instance, after collecting historical data over the three previous years, such data may 
be subject to statistical cluster analysis (as described in more detail below). The 
resulting clusters may then be interpreted as distinct economic environments, for 
which different price or return models may be generated. The subsequent scenario- 

25 based processing will then use the model corresponding to the environment in which 
the input scenario falls. Utilizing separate scenarios in this fashion often may provide 
more accurate prediction and estimation results, because each model can be 
separately tailored to a unique environment and also because, at least in the case 
of a Taylor or Maclaurin series expansion, the dispersion of the historical data points 

30 around the expansion point often can be significantly reduced. It is noted that, 
generally, each such expansion point will be located at or near the center of the 
corresponding environment. 
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In this latter regard, it is noted that the location of the expansion point for a 
Taylor or Maclaurin series expansion generally will affect the accuracy of the 
resulting model. In addition to locating the expansion points as indicated above, the 
expansion point may be located at, or otherwise based on, the input scenario. 

5 Similarly, the expansion point may be located at, or otherwise based on, an 
independently generated prediction of the future environment, such as using a 
prediction made in accordance with any of the techniques described in co-pending 
applications 09/392,361, 09/391,765, 09/392,109, 09/391,962, 09/391,534, 
09/392,106, or09/391 ,764, filed September8, 1999, or09/494,200, filed January 28, 

10 2000. 

By using a large number of exogenous variables (e.g., at least 30 to 50) the 
price formulas (or return formulas) of the present invention often can approximate 
the reduced form of the actual value of (or return to) the asset, whatever that form 
may happen to be. As a result, it is preferable to use such a large number of 

15 variables in the technique of the present invention. Currently, it is most preferable 
to use approximately 35 exogenous variables. 

The above discussion frequently refers to the "value" of an asset. Generally, 
the value of an asset will be the price at which it is traded. However, other 
measurements of value may be used in addition to or in place of selling price. Such 

20 other measurements may be of particular importance, for example, when the subject 
asset is thinly traded, the subject asset frequently is traded in combination with other 
assets, or there exists any other factor that makes selling price an inappropriate 
indicator of the asset's value. As used herein, an "asset" may refer to a stock, a 
commodity, an index, a mutual fund, a derivative instrument whose value is based 

25 on the value, or on some other characteristic, of any of the foregoing, or any other 
item of value. 

The issue of statistical significance of the estimated measures of tendencies 
of the asset values to change based on changes in the exogenous variables is 
important. There may be numerous instances where there is no statistical 
30 significance to the estimated price formula and, consequently, no statistical 
significance to the estimated measure of tendency to change. 

One approach to this problem is to use a statistical significance threshold 
(e.g., as part of any screening). The statistical significance of each coefficient can 
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be tested using Student's t-test. Similarly, the statistical significance of groups of 
coefficients can be tested using the f-test. With respect to the latter, a number of 
groups may be defined, each group corresponding to a single exogenous variable 
and including the coefficients associated with all terms that include that exogenous 
5 variable. It is noted that in this example, if a second or higher order Taylor series 
expansion had been used, the existence of cross-product terms will mean that the 
defined groups will overlap. 

Alternatively, the group may include all coefficients used in the price formula. 
Then, any asset for which the identified coefficients have insufficient statistical 

10 significance (e.g., a p value exceeding some threshold, such as 5% significance) 
preferably would be excluded from the candidate pool for screening and generally 
would not be used for most other purposes in which the asset would be considered 
individually. However, in certain cases where aggregate statistics are to be 
calculated across multiple assets, the data for such an asset may be useful. 

15 In the foregoing estimation of statistical significance, the p value associated 

with any given t-test or f-test can be estimated with reference to a specified 
confidence interval for each of the subject coefficients. Alternatively, such 
confidence interval(s) can be specified and then the p value associated with such 
confidence interval(s) can be determined. For instance, it is possible to specify a 

20 confidence interval of ± 5% for each coefficient (i.e. , for each coefficient, the interval 
from 95% to 105% of the estimated value for such coefficient) and then determine 
the p value associated with such interval(s) (i.e., the probability that any of such 
coefficients is outside of the ± 5% confidence interval for its estimated value). 
Typically, such a probability will not be constant, but rather will depend upon the 

25 particular scenario (i.e., the input data values for the set of exogenous variables). 
For instance, the p value generally be significantly higher within a region of the 
exogenous variable space in which relatively little of the historical data used in 
creating the underlying model is located than in other regions where more such 
historical data points were located. Similarly, for a specified p value, the width of the 

30 confidence intervals typically will depend upon the particular scenario, with wider 
intervals tending to occur in regions of the exogenous variable space in which there 
was relatively little historical data input in step 122. 
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It is noted that the f-test can be applied to the price formula to determine the 
statistical significance of the value estimate or to the partial derivatives of the price 
formula to determine the statistical significance of the sensitivity, elasticity or similar 
measure. In the event that the f-test is applied to all coefficients in a formula, one 
5 can obtain a p value that corresponds to a specified confidence interval for the 
endogenous variable or, alternatively, a confidence interval for a specified p value. 
Thus, it is possible to calculate a p value for value of an asset within a specified 
confidence interval, a p value for a measure of the tendency of an asset value to 
change within a specified confidence interval, or corresponding confidence intervals 
10 given specified p values. 

In addition to evaluation and screening based on a single scenario, the 
present invention also contemplates evaluating and screening based on multiple 
different scenarios. For instance, the user may input a range of data values for one 
or more of the exogenous variables. In this case, the evaluation/screening tool of the 
15 present invention preferably samples the data values within each such range and 
combines the sampled data values to provide multiple different scenarios. After 
calculating asset values and tendencies of asset values to change for each such 
scenario, the evaluation/screening tool may output a range of asset values and a 
range of elasticities (or similar measures) for each asset. Such information may then 
20 be used as the basis for screening criteria. 

As will be observed from the above discussion, the asset evaluation/screening 
tool of the present invention can provide a user with a variety of information that can 
be directly used to maximize the value of the user's portfolio, while limiting the user's 
exposure to particular risk. For example, the user can alter the mix of his or her 
25 portfolio, input a projected scenario, view how the portfolio value and exposure to 
specific risks changes based on that projected scenario, search for optimal assets 
or combinations of assets under specified criteria, and then repeat this process for 
different portfolio compositions, different scenarios and/or different financial and 
economic criteria or goals. 

30 

Significance-Based Display . 

Once data have been generated by the evaluation/screening tool of the 
present invention, it often will be desirable to display some or all of such data to the 
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end user. For example, it may be desirable to display data concerning the elasticities 
of various assets to the rate of inflation during a given scenario. Conventionally, 
information may be displayed in several different ways. For instance, it is possible 
to display such information in a tabular format or in a graphical format. In the 
5 evaluation/screening tool of the present invention, due to the large amount of 
information that must be presented simultaneously, it is preferable to display at least 
some of the information graphically. 

For instance, the elasticity data in the above example preferably is displayed 
in a bar graph format, with each different asset corresponding to a point on the x axis 
10 and the elasticity for each asset being represented by a bar whose height 
corresponds to the magnitude of the elasticity, and with the bar originating at y=0 and 
extending upward for positive elasticity and downward for negative elasticity. In 
addition to indicating the magnitude and direction of the elasticity for each asset, the 
display according to the preferred embodiment of the present invention also indicates 
15 the statistical significance of the elasticity for each asset. More preferably, the 
intensity at which the bar for each asset is displayed preferably is a function of the 
statistical significance of the calculated elasticity for that asset. Such a display is 
illustrated in Figure 3. 

Specifically, Figure 3 illustrates a bar graph according to the present invention. 
20 In Figure 3, each bar corresponds to a different asset (e.g., stock) and the height (or 
length) of the bar is proportional to the asset's elasticity to a specified exogenous 
variable (e.g., the Federal Funds Rate). It should be noted that the height of each 
bar may instead be any other function of the elasticity for the corresponding asset, 
although preferably that function is the same for all assets that are displayed at the 
25 same time. Also, although elasticities for various assets are displayed in Figure 3, 
any other measure of a tendency of an asset's value to change based on a change 
in an exogenous variable may instead be displayed. For simplicity, the following 
discussion will continue to refer to elasticities, it being understood any other such 
measure of tendency to change may be substituted therefor. 
30 As discussed above, the statistical significance for each elasticity calculation 

can be determined, such as by applying the f-test to the coefficients of the elasticity 
formula, which in turn may be derived from the price formula. The resulting p value 
provides a measure of the statistical significance of the calculated elasticity. As also 
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noted above, the p value may be tied to a confidence interval for the determined 
elasticity or to a set of confidence intervals for the coefficients used in the elasticity 
formula. Similarly, the p value may be dependent upon the point in the exogenous 
variable space (i.e., the particular input scenario). In the preferred embodiment of 
5 the invention, the p values are calculated with respect to similar confidence intervals 
across all assets to be displayed and at the same input scenario, such that 
comparisons between the p values will be meaningful. 

The bars 180 to 183 in Figure 3 reflect the different statistical significances of 
the different assets by being displayed in different intensities, the intensity of each 
10 such bar being a function of the statistical significance of the corresponding asset. 
For instance, the intensity at which a bar is displayed might be equal to 1 minus the 
p value for the corresponding asset, where intensity ranges from 0 (meaning that the 
bar is not displayed at all) to 1 (meaning maximum intensity). In Figure 3, intensity 
is illustrated by the density of the horizontal lines within a bar. Thus, bar 180 is 
15 displayed at a high intensity, indicating that the calculated elasticity of the 
corresponding asset is highly statistically significant (e.g., a p value of 0.05). By 
contrast, bar 1 81 is displayed very lightly, indicating a very low statistical significance 
(e.g., a p value of 0.90). Between these extremes are bars 182 and 183, which 
indicate intermediate levels of statistical significance. 
20 Using such a linear relationship between display intensity and statistical 

significance may be desirable in certain embodiments. However, in other 
embodiments it may be more desirable to highlight certain differences more than 
others. For instance, if one is only interested in very significant data, it may be more 
desirable to non-linearly map the p value (or other measure of statistical significance) 
25 to intensity such that more intensity levels are used in the high end of statistical 
significance (e.g., around p values near 0) than at the low end of statistical 
significance (e.g., around p values near 1). 

Also, although a bar graph is utilized in the foregoing example, it should be 
understood that the technique of varying display intensity levels based on the 
30 statistical significance of the data being displayed can be beneficially used in other 
graphical display methods as well. Such other graphical display methods include 
simply plotting individual data points on a graph, graphical techniques in which a line 
or curve is interpolated between each adjacent pair of data points so as to indicate 
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a continuously changing endogenous variable, and any other graphical display 
method. Similarly, such techniques may be applied in any other situation where data 
to be displayed have been estimated and have an associated statistical significance. 
The actual measure of statistical significance preferably depends upon the type of 
5 data to be displayed, and may include, for example, standard deviation, variance, 
correlation coefficient, and/or any function of the foregoing, in addition to p value. 

The variation in display intensity required for the present invention can be 
accomplished using known techniques. When using a display device having variable 
intensity, such as a cathode ray tube (CRT) display, the required intensity is simply 
10 specified for each display point. When using a monochrome display device, such 
as certain liquid crystal displays or many printing techniques, the appearance of 
varying intensity can be provided by using halftoning, error diffusion or other known 
techniques. 

As indicated above, in the preferred embodiments of the invention, statistical 

15 significance is displayed by changing the intensity of the displayed data points as a 
function of their statistical significance. However, it is also possible to graphically 
indicate statistical significance in other ways as well, including other ways in which 
statistical significance is indicated without requiring a separate coordinate for it on 
the graph. For instance one could vary the size of a displayed data point, the width 

20 of each bar in a bar graph, or the width of line and/or curve segments in a chart 
graph as a function of the statistical significance of the corresponding data points. 
Alternatively, one could vary the hue, saturation, brightness or any other display 
characteristic of the displayed points as a function of statistical significance. For 
example, colors at the red end of the color spectrum might indicate low statistical 

25 significance while colors at the violet end of the color spectrum would indicate high 
statistical significance, or vice versa. As used herein, "display characteristic" is 
intended to mean the way in which a data point is displayed, rather than the position 
at which it is displayed. With any of such alternate display techniques, as well as the 
preferred intensity-based technique, the display characteristic (e.g., size, width or 

30 color property) may be related to the statistical significance by any linear or non- 
linear function. 
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Identifying Industry Sectors Using Clusterization . 

Once asset sensitivities, elasticities or other measures of tendency to change 
with respect to a number of different exogenous variables have been calculated, 
such as pursuant to the techniques described above, such measures can then be 
5 used to identify true industry sectors using conventional clusterization techniques. 
For example, assume that there exists a collection of stocks, such as several 
hundred or several thousand different stocks, that are to be assigned to industry 
sectors. Assume further that elasticities have been calculated for each such stock 
with respect to each of a number of different exogenous variables (e.g., between 30 

10 and 50 such variables). In order to initially simplify the discussion, it is also assumed 
that the elasticities of each asset value to each exogenous variable is constant, such 
as may have been obtained by performing a multi-variate linear regression. In such 
a case, utilizing cluster analysis, a standard statistical grouping method, in an 
innovative manner, the present invention is able to identify relevant sectors and 

15 simultaneously assign the various stocks into those sectors. Accordingly, the 
problems with conventional sectoral analysis, sectoral definition and asset 
classification, are solved simultaneously. 

Cluster analysis algorithms (such as are available in Systat and numerous 
other multi-variate statistics computer programs) attempt to group the data into 

20 clusters such that the measured distance between individual data points within each 
cluster is a minimum, but also such that the measured distance between any two 
clusters is maximized. In other words, cluster analysis attempts to group data points 
so that the groups are as much alike as they can reasonably be, but also so the 
groups are as reasonably different from other groups as they can be. There are 

25 numerous standard methods for clustering data which could be employed, including: 
discrimination functions, factor analysis, and grouping techniques such as iterated 
Chi-Square and maximum-distance measures. 

A preferred embodiment of the invention uses the KMEANS statistical 
procedure, included in statistical packages such as SYSTAT and the S+ statistical 

30 modeling language. The KMEANS algorithm splits N assets into groups by 
maximizing the between-group distance and minimizing the within-group distance. 
It is noted that there are numerous possible distance measures which could be used, 
such as Pearson Product Moment Correlation, Sum of Squared Deviations, and 
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Rsquared (1 - Squared Pearson Product Moment Correlation), or the Minkowski 
distance, the z-th root of the mean z-th powered coordinate distance, e.g., with an 
initial parameter z = 2. 

The cluster analysis of the present invention may be performed over the entire 
5 set of exogenous variables or over any subset thereof. By defining each resulting 
cluster to be a sector, the present invention automatically provides sector definitions. 
Moreover, because the sectors are formed by clustering assets having similar 
elasticities (or other measures of tendency to change based on changes in the 
exogenous variables), it is more likely than in conventional techniques that the stocks 

10 in each sector do in fact respond similarly to market conditions or, more accurately, 
to the set of conditions represented by the exogenous variables used. 

After having obtained sectors and sector assignments in this manner, 
aggregate statistics for each sector can be calculated and monitored over time in 
order to assess changes in various industries and to utilize such changes to predict 

15 changes in other industries, as well as in various macroeconomic data. Such 
aggregate statistics might include, for example, total gross revenues, total profits, 
total employment, average profit margin, total market capitalization, total inventory 
as well as changes in the foregoing data. Based on the predictions derived from 
such data, assets maybe purchased or sold. For instance, declining profitability and 

20 increasing inventory in a sector that includes a significant number of computer 
hardware manufacturers might signal a future decrease in demand for computer 
chips, prompting one to sell stock in computer chip manufacturers. 

Preferably, the elasticities for the stocks in the current example will have been 
determined by using data over some fixed interval of time. By recalculating such 

25 elasticities on a rolling basis, one can observe how assets move both relative to their 
clusters and among clusters overtime. Any such changes might signal, for example, 
a change in the direction or management of the underlying company, a change in a 
company's methods or technology that is making that company's business more or 
less dependent on a particular input to production (e.g., a particular type of labor or 

30 raw materials), or even a diversification by the company into other types of business 
that are affected by different conditions. In addition, one may observe how the 
sector definitions themselves change overtime, indicating potential changes in an 
entire industry. 
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In a somewhat more complicated example, assume that the elasticities (or 
other measures of tendency of asset value to change as of result of changes in the 
exogenous variables) are expressed as a function of the exogenous variables. This 
generally will be the case, for example, where the price function or return function 
5 has been determined using a non-linear regression or a neural network technique. 
In this case, the assets can be clustered using the foregoing technique and inputting 
current data values for the exogenous variables (i.e., using the current scenario). 
Alternatively, the assets might be clustered over multiple scenarios, such as by 
sampling the elasticities using such multiple scenarios and then clustering on the 

10 basis of all resulting data. Such multiple scenarios might be selected, for example, 
to include the current scenario and group of scenarios in the neighborhood of the 
current scenario. Still further, separate clustering might be performed for each such 
separate scenario and then the resulting sectors compared across different 
scenarios. Also, as with the example described above, the sectors may be 

15 recomputed on a rolling basis and changes in both the assets and the sector 
definitions observed overtime. 

The various techniques described above may be used in any or all possible 
combinations, depending upon the data needs of the end user. Common to all such 
embodiments, however, is the grouping of assets based on similarities of their 

20 tendencies to change in value as the result of changes in a set of exogenous 
variables. The most common application of this aspect of the invention is for use in 
defining business sectors and for classifying stocks into those sectors. However, the 
techniques described above may be used on connection with any other types of 
assets as well. By grouping assets in this manner, the present invention provides the 

25 basis for predicting future changes in both asset values and macroeconomic 
variables. Such data and predictions can be directly incorporated into existing and 
future models for selecting stocks and other assets to purchase and sell, thereby 
having direct application to asset portfolio management and financial planning. In 
fact, many existing models incorporate sectoral statistics for just such purposes. The 

30 results of this aspect of the present invention can be used beneficially in such 
models. Moreover, because sectoral analysis of the present invention overcomes 
many of the problems of conventional sectoral analysis techniques, substitution 
using the results of the present technique often will provide more accurate 
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information, thus permitting those models to provide more effective buy/sell 
strategies. 

Design System Environment . 
5 Generally, the methods described herein will be practiced with a general- 

purpose computer, either with a single processor or multiple processors. Figure 4 
is a block diagram of a general-purpose computer system, representing one of many 
suitable computer platforms for implementing the methods described above. Figure 
4 shows a general-purpose computer system 250 in accordance with the present 
10 invention. As shown in Figure 4, computer system 250 includes a central processing 
unit (CPU) 252, read-only memory (ROM) 254, random access memory (RAM) 256, 
expansion RAM 258, input/output (I/O) circuitry 260, display assembly 262, input 
device 264, and expansion bus 266. Computer system 250 may also optionally 
include a mass storage unit 268 such as a disk drive unit or nonvolatile memory such 
15 as flash memory and a real-time clock 270. 

CPU 252 is coupled to ROM 254 by a data bus 272, control bus 274, and 
address bus 276. ROM 254 contains the basic operating system for the computer 
system 250. CPU 252 is also connected to RAM 256 by busses 272, 274, and 276. 
Expansion RAM 258 is optionally coupled to RAM 256 for use by CPU 252. CPU 
20 252 is also coupled to the I/O circuitry 260 by data bus 272, control bus 274, and 
address bus 276 to permit data transfers with peripheral devices. 

I/O circuitry 260 typically includes a number of latches, registers and direct 
memory access (DMA) controllers. The purpose of I/O circuitry 260 is to provide an 
interface between CPU 252 and such peripheral devices as display assembly 262, 
25 input device 264, and mass storage 268. 

Display assembly 262 of computer system 250 is an output device coupled 
to I/O circuitry 260 by a data bus 278. Display assembly 262 receives data from I/O 
circuitry 260 via bus 278 and displays that data on a suitable screen. 

The screen for display assembly 262 can be a device that uses a cathode-ray 
30 tube (CRT), liquid crystal display (LCD), or the like, of the types commercially 
available from a variety of manufacturers. Input device 264 can be a keyboard, a 
mouse, a stylus working in cooperation with a position-sensing display, or the like. 
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The aforementioned input devices are available from a variety of vendors and are 
welt known in the art. 

Some type of mass storage 268 is generally considered desirable. However, 
mass storage 268 can be eliminated by providing a sufficient mount of RAM 256 and 

5 expansion RAM 258 to store user application programs and data. In that case, 
RAMs 256 and 258 can optionally be provided with a backup battery to prevent the 
loss of data even when computer system 250 is turned off. However, it is generally 
desirable to have some type of long term mass storage 268 such as a commercially 
available hard disk drive, nonvolatile memory such as flash memory, battery backed 

10 RAM, PC-data cards, or the like. 

A removable storage read/write device 269 may be coupled to I/O circuitry 
260 to read from and to write to a removable storage media 271. Removable 
storage media 271 may represent, for example, a magnetic disk, a magnetic tape, 
an opto-magnetic disk, an optical disk, or the like. Instructions for implementing the 

15 inventive method may be provided, in one embodiment, to a network via such a 
removable storage media. 

In operation, information is input into the computer system 250 by typing on 
a keyboard, manipulating a mouse or trackball, or "writing" on a tablet or on 
position-sensing screen of display assembly 262. CPU 252 then processes the data 

20 under control of an operating system and an application program, such as a program 
to perform some or all of the steps of the inventive methods described above, stored 
in ROM 254 and/or RAM 256. It is noted that such process steps may initially be 
stored in mass storage 268, downloaded into RAM 256 and then executed out of 
RAM 256. CPU 252 then typically produces data which is output to the display 

25 assembly 262 to produce appropriate images on its screen. 

Expansion bus 266 is coupled to data bus 272, control bus 274, and address 
bus 276. Expansion bus 266 provides extra ports to couple devices such as network 
interface circuits, modems, display switches, microphones, speakers, etc. to CPU 
252. Network communication is accomplished through the network interface circuit 

30 and an appropriate network. 

Suitable computers for use in implementing the present invention may be 
obtained from various vendors. Various computers, however, may be used 
depending upon the size and complexity of the tasks. Suitable computers include 
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mainframe computers, multiprocessor computers, workstations or personal 
computers. In addition, although a general purpose computer system has been 
described above, a special-purpose computer may also be used. 

It should be understood that the present invention also relates to machine 

5 readable media on which are stored program instructions for performing some or all 
of the methods of this invention. Such media include, by way of example, magnetic 
disks, magnetic tape, optically readable media such as CD ROMs, semiconductor 
memory such as PCMCIA cards, etc. In each case, the medium may take the form 
of a portable item such as a small disk, diskette, cassette, etc., or it may take the 

10 form of a relatively larger or immobile item such as a hard disk drive or RAM 
provided in a computer. 

Conclusion . 

Although the present invention has been described in detail with regard to the 
15 exemplary embodiments and drawings thereof, it should be apparent to those skilled 

in the art that various adaptations and modifications of the present invention may be 

accomplished without departing from the spirit and the scope of the invention. 

Accordingly, the invention is not limited to the precise embodiments shown in the 

drawings and described in detail above. Rather, it is intended that all such variations 
20 not departing from the spirit of the invention be considered as within the scope 

thereof as limited solely by the claims appended hereto. 

Also, several different embodiments of the present invention are described 

above, with each such embodiment described as including certain features. 

However, it is intended that the features described in connection with the discussion 
25 of any single embodiment are not limited to that embodiment but may be included 

and/or arranged in various combinations in any of the other embodiments as well, 

as will be understood those skilled in the art. 
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CLAIMS 

What is claimed is: 

1 . A method for displaying information, said method comprising: 
obtaining a plurality of data points, each said data point including an estimated 

statistic; 

obtaining a measure of statistical significance for each said estimated statistic; 

and 

displaying a graph of said plurality of data points, 

wherein each said data point is displayed at an intensity level that is a function 
of the measure of statistical significance of the estimated statistic included in said 
data point. 

2. A method according to Claim 1 , wherein each said data point includes 
an identification of an asset and a measure of a tendency of a value of the asset to 
change as a result of a change in a data value for an exogenous variable. 

3. A method according to Claim 2, wherein said data points are displayed 
in a bar graph that includes a separate bar for each asset. 

4. A method according to Claim 3, wherein each said bar is displayed at 
an intensity level that is a function of the measure of statistical significance of the 
measure of the tendency of the value of the asset corresponding to said bar to 
change. 

5. A method according to Claim 4, wherein a height of each said bar is a 
second function of the measure of the tendency of the value of the asset to change 
as a result of a change in the data value for the exogenous variable. 

6. A method according to Claim 1, wherein the estimated statistic was 
estimated using a regression equation, and wherein the measure of statistical 
significance is a p value that was calculated from the regression equation. 
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7. A method according to Claim 1 , wherein the function is linear. 

8. A method according to Claim 1 , wherein the function is non-linear. 

9. A method according to Claim 1, wherein each said data point is 
displayed as a bar in a bar graph. 

10. A method according to Claim 1 , wherein said statistical significance is 
an estimate of a probability that an actual value for said estimated statistic is outside 
of a specified confidence interval around an estimated value for said estimated 
statistic. 

11. A method according to Claim 10, wherein calculation of the intensity 
for each said data point comprises determining 1 minus said estimate of probability. 

12. A method for displaying information, said method comprising: 
obtaining a plurality of data points, each said data point including an estimated 

statistic; 

obtaining a measure of statistical significance for each said estimated statistic; 

and 

displaying a graph of said plurality of data points, 

wherein a display characteristic of each said data point is a function of the 
measure of statistical significance of the estimated statistic included in said data 
point. 

1 3. A method according to Claim 1 2, wherein said display characteristic is 
a size of said each data point. 

14. A method according to Claim 1 , wherein said display characteristic is 
a hue at which said each data point is displayed. 

1 5. A method according to Claim 1 , wherein said display characteristic is 
a saturation at which said each data point is displayed. 
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16. A method according to Claim 1 , wherein said display characteristic is 
a brightness at which said each data point is displayed. 

1 7. A method according to Claim 1 , wherein said display characteristic is 
a color characteristic with which said each data point is displayed. 

18. A method according to Claim 1, wherein each said data point is 
displayed as a bar in a bar graph. 

19. An apparatus for displaying information, said apparatus comprising: 
means for obtaining a plurality of data points, each said data point including 

an estimated statistic; 

means for obtaining a measure of statistical significance for each said 
5 estimated statistic; and 

means for displaying a graph of said plurality of data points, 

wherein each said data point is displayed at an intensity level that is a function 

of the measure of statistical significance of the estimated statistic included in said 

data point. 

20. An apparatus for displaying information, said apparatus comprising: 
means for obtaining a plurality of data points, each said data point including 

an estimated statistic; 

means for obtaining a measure of statistical significance for each said 
5 estimated statistic; and 

means for displaying a graph of said plurality of data points, 

wherein a display characteristic of each said data point is a function of the 

measure of statistical significance of the estimated statistic included in said data 

point. 

21. A computer-readable medium storing computer-executable process 
steps for displaying information, said process steps comprising steps to: 

obtain a plurality of data points, each said data point including an estimated 
statistic; 
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5 obtain a measure of statistical significance for each said estimated statistic; 

and 

display a graph of said plurality of data points, 

wherein each said data point is displayed at an intensity level that is a function 
of the measure of statistical significance of the estimated statistic included in said 
10 data point. 

22. A computer-readable medium storing computer-executable process 
steps for displaying information, said process steps comprising steps to: 

obtain a plurality of data points, each said data point including an estimated 
statistic; 

5 obtain a measure of statistical significance for each said estimated statistic; 

and 

display a graph of said plurality of data points, 

wherein a display characteristic of each said data point is a function of the 
measure of statistical significance of the estimated statistic included in said data 
10 point. 
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ABSTRACT OF THE DISCLOSURE 

Provided is a technique for displaying information. Initially, a number of data 
points is obtained, each data point including an estimated statistic. A measure of the 
5 statistical significance for each estimated statistic also is obtained, and a graph of 
the data points is displayed. According to this aspect of the invention, a display 
characteristic (such as intensity, size, brightness, hue, saturation or any other color 
characteristic) of each data point is a function of the measure of statistical 
significance of the estimated statistic included in such data point. 
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